Chapter ? ? High - Dimensional Classification ∗

نویسندگان

  • Jianqing Fan
  • Yingying Fan
  • Yichao Wu
چکیده

In this chapter, we give a comprehensive overview on high-dimensional classification, which is prominently featured in many contemporary statistical problems. Emphasis is given on the impact of dimensionality on implementation and statistical performance and on the feature selection to enhance statistical performance as well as scientific understanding between collected variables and the outcome. Penalized methods and independence learning are introduced for feature selection in ultrahigh dimensional feature space. Popular methods such as the Fisher linear discriminant, Bayes classifiers, independence rules and distance based classifiers and loss-based classification rules are introduced and their merits are critically examined. Extensions to multi-class problems are also given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of Chronic Kidney Disease Patients via k-important Neighbors in High Dimensional Metabolomics Dataset

Background: Chronic kidney disease (CKD), characterized by progressive loss of renal function, is becoming a growing problem in the general population. New analytical technologies such as “omics”-based approaches, including metabolomics, provide a useful platform for biomarker discovery and improvement of CKD management. In metabolomics studies, not only prediction accuracy is ...

متن کامل

Design of an Adaptive Classification Procedure for the Analysis of High-Dimensional Data with Limited Training Samples

.......................................................................................................................................................... V CHAPTER 1: INTRODUCTION ..................................................................................................................... 1 CHAPTER 2: EFFECT OF SEMI-LABELED SAMPLES IN REDUCING THE SMALL SAMPLE SIZE PROBLEM AND MITIGATI...

متن کامل

Classification of High Dimensional Data

................................................................................................................... v CHAPTER 1: INTRODUCTION .................................................................................. 1 1.1 Background ................................................................................................... 1 1.2 Statement of the Problem ...........................

متن کامل

Classification of High Dimensional Data with Limited Training Samples

.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv CHAPTER 1: INTRODUCTION ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 1.1 Background ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . ....

متن کامل

Bayesian Classification and Regresssion with High Dimensional Features

This thesis responds to the challenges of using a large number, such as thousands, of features in regression and classification problems. There are two situations where such high dimensional features arise. One is when high dimensional measurements are available, for example, gene expression data produced by microarray techniques. For computational or other reasons, people may select only a sma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009